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We review the well-known polarization optics matrix methods, i.e., Jones and Stokes-Mueller 
calculus, and show how they can be formulated in terms of four-dimensional (4d) rotations of the 
four independent electromagnetic field quadratures. Since 4d rotations is a richer description than 
the conventional Jones and Stokes-Mueller calculi, having six rather than four degrees of freedom 
(DOF), we propose an extension of those calculi to handle all six DOF. 

For the Stokes-Mueller analysis, this leads to a novel and potentially useful extension that accounts 
for the absolute phase of the optical field, and which can be valuable in the areas where the optical 
phase is of interest, e.g. interferometry or coherent communications. As examples of the usefulness 
we use the formalism to explain the Pancharatnam phase by parallel transport, and shows its 
connection with the Berry phase. 

In addition we show that the two extra DOF in the 4d description represents unphysical trans- 
formations, forbidden for propagating photons, since they will not obey the fundamental quantum 
mechanical boson commutation relations. 



I. INTRODUCTION 

This paper presents an new formalism for polariza- 
tion calculus by the use of four dimensional (4d) rota- 
tions. This will then lead to extension of the classi- 
cal polarization-optics calculi (due to Jones and Stokes- 
Mueller, respectively), to be as geometrically rich as the 
set of four-dimensional rotations. We begin with an his- 
torical overview that will lead to a motivation and outline 
of this paper. 

A. Historical background 

The use of matrix methods to calculate the polariza- 
tion evolution for electromagnetic waves was, given the 
earlier discoveries within optics, a relatively late inven- 
tion, originally proposed by Jones [T] and Mueller [31 [3] 
in the 1940:ies. One reason was probably that before 
this time relatively few polarization devices were avail- 
able, and there was no demand for what we now refer to 
as "Jones calculus" and "Stokes-Mueller calculus" . How- 
ever all the knowledge was there: already in 1818 Young 
(following a proposition by Hooke in 1757, and also work 
by Arago in the early 19th century) showed that light was 
a transverse wave. Indeed both Maxwell [H Ch. 21, Par. 
812-813] and Poincare [5] treated polarization as two in- 
dependent transverse field components with orthogonal 
directions in their books from the early 1890s. Poincare 
even showed that the complex ratio of these two waves 
could be conveniently mapped (via a stereographic pro- 
jection) to the sphere that since then bears his name [5, 
Ch. 12]. In [B] a more modern discussion of this pro- 
jection is given. In fact, Poincare even showed how to 
propagate the complex amplitude ratio of the two field 
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components via a bilinear transformation [5J Ch. 12, 
Par. 158], which was an early precursor to today's Jones 
calculus. 

Paralleling those developments, the 1852 landmark pa- 
per by Stokes [7] is worth mentioning for two reasons; 
(i) it created the first framework for analyzing polarized 
and unpolarized light (leading to the first mathematical 
description of the Fresnel-Arago interference laws [H p. 
xv]), and (ii) for the introduction of the four parameters 
(nowadays referred to as Stokes parameters, which can be 
formed in to a 4-component Stokes vector) that enabled 
this framework. The Stokes parameters, which originated 
from experiments and were based on essentially measur- 
able quantities, seemed in the late 19th century quite 
disconnected from the transverse field components of the 
electromagnetic field as it was described by Maxwell [4 
and Poincare [5] . Later work in the first half of the 20th 
century, by e.g. Soleillet [Sj, Perrin [TO], and Mueller [2J, 
connected the Stokes parameters with the Cartesian co- 
ordinates of the Poincare sphere, and also showed how the 
Stokes parameters are linearly transformed as light prop- 
agates through birefringent and polarizing media. The 
stage was thus set for the matrix methods of relating 
input to output states in polarization systems. 

While Jones put forward the calculus named after him 
in a series of papers [T|, the history of the Mueller cal- 
culus is a bit more involved. Mueller had originally de- 
scribed his matrix method in the (since 1960 declassified) 
defense report [2], and then building on the earlier work 
by mainly Perrin [10] . A more thorough description was 
then given in Parke's 1948 PhD thesis [3], supervised by 
Mueller. 

The next important development was made by Falkoff 
and MacDonald [H], who connected the Stokes parame- 
ters with the density matrix (in classical optics also re- 
ferred to as the coherency matrix), which can be directly 
related to the Jones vector. In fact, this mathematical 
relationship was pointed out already 20 years earlier by 
Wiener [T21 Ch. 9] , although he was unaware of the phys- 
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ical relation to Stokes' four parameters. Important steps 
towards a more modern notation of polarization calculus 
were then made by Fano[T3"] and Barakat[14 , who intro- 
duced the Pauli matrices as a convenient bridge between 
Jones vectors/coherency matrices and Stokes vectors. 

We thus have two ways of describing polarization 
transformations; via 2x2 complex Jones matrices, or 
the 4x4 real Mueller matrices. The Jones matrices has 
the advantage that they can describe also the absolute 
optical phase change. Often this is of little relevance, 
however interferometry and coherent optical communi- 
cations are two notable and very important exceptions. 
The Mueller calculus has the advantage of also being able 
to describe partially polarized light. Since they describe 
the same phenomena, it should come as no surprise that 
there is a deeper, underlying mathematical relationship 
between Jones and Mueller matrices, first pointed out 
by Takenaka|15j. In the general case this stems from the 
known mathematical fact that the group of complex 2x2 
Jones matrices is isomorphic to the Lorentz group of 4 x 4 
transformations [T5]. The fact that the Lorentz group is 
mostly associated with relativistic dynamics has some- 
times obscured its role in Mueller calculus. Although the 
relationship have been discussed in the literature [T6HT9] 
it is not widely known in the optics community. 

Finally the use of quaternions have been proposed for 
polarization calculus, both for the use as Jones (state) 
vectors [50] as well as to simplify the calculation of Jones 
matrix cascades [21] . In this context, quaternion algebra 
can be used to replace, or computationally simplify, Jones 
or Mueller matrix multiplications. 

If we restrict the polarization transformations to pure 
birefringence (no polarization dependent losses), it is 
enough to use only 3 parameters of the Stokes vector. 
The Mueller matrices will then belong to the group 
of 3 x 3 rotation (orthogonal) matrices, and the sub- 
group with positive deteminants is commonly denoted 
O3 . This group is isomorphic to the set of unitary 2x2 
Jones matrices (the SU{2) group) [HI [16]. An general 
introductory discussion on the 0$ - SU(2) isomorphism 
can be found in [551 Ch. 4.10]. Both these groups have 3 
degrees of freedom (DOF) , since 3 parameters are needed 
to describe a three-dimensional (3d) rotation (e.g. two 
for the axis and a third for the angle of rotation). In 
addition, for Jones matrices, the absolute phase can be 
added as a 4th degree of freedom. 

In this work we will present a new description of elec- 
tromagnetic wave transformations, namely that of using 
the four-dimensional real space. This might seem like a 
trivial extension of the Jones calculus - merely separating 
the real and imaginary parts of the complex (Jones-) field 
components into four real vector components. However, 
as will be shown, and which is somewhat surprising, is 
that this leads to a more general description than both 
the classical Jones and Stokes-Mueller formalisms. 



B. Motivation and outline 

The description of a polarized electromagnetic wave 
in terms of its four real quadrature components was pi- 
oneered in the area of optical communications [551 154"] . 
where such a formalism is relevant since noise is (in 
the semi-classical model) linearly added independently 
to each quadrature, or dimension. The signal is then 
described by a real 4-component vector, with a length 
squared equal (or proportional) to the power. A lossless 
propagation of such a signal vector can then be described 
by the 4d real analog to the complex Jones matrix (see 
e.g. Eq. (9) in [51]). which has 4 independent parame- 
ters, or equivalently, DOF. Obviously, such a 4 x 4 trans- 
fer matrix is a four-dimensional (4d) rotation, since it 
will not change the length (or power) of the state vec- 
tor. However, 4d rotations form a 6-parameter group 
[25] . implying that the 4 DOF Jones matrices does not 
cover the full set of possible transformations in real 4d 
space |26].The question then emerges: these extra two 
DOF in the 4d rotations, what kind of transformations 
do they correspond to in Jones and Stokes space? This 
paper is addressing that question. 

As we will see the answer is very interesting, both 
from a mathematical and physical perspective. There 
is also a real practical relevance from the application 
of matrix methods to fiber optic transmission. We will 
show that the two extra DOFs of the 4d rotations cor- 
responds to forbidden transformations, i.e. polarization 
transformations that can be mathematically described by 
extensions of Jones and Stokes-Mueller calculi, but which 
does not have any physical realization in electromagnetic 
wave propagation [57]. But then why are they of interest? 
They are of interest for at least three reasons: 

Extension: They enable us to extend the existing cal- 
culi to cover a richer set of dynamics. For example, we 
will present the (perhaps first) mathematically consistent 
generalization of the Stokes vector to account for the ab- 
solute phase of the optical field. 

Commutation: The forbidden transformations com- 
mute with the allowed (conventional) ones. This is a 
unique property for these transformations that may have 
practical applications. 

Synthesis: Even if the forbidden transformations do 
not exist in nature, they can be synthesized in digital 
signal processing and waveform generation and might as 
such be of use in e.g. coherent optical transmitters and 
receivers. 

In this first paper on 4d polarization calculus we will 
limit the formalism to unitary (power conserving) trans- 
formations, which corresponds to purely birefringent me- 
dia. It is quite likely possible, but beyond the present 
scope, to generalize this work to non-unitary transfor- 
mations, as required for a description of e.g. polarization 
dependent losses. (N.B.: complex 2x2 Jones matrices 
have 8 DOF, real 4x4 matrices has twice as many, 16 
DOF. What do the extra 8 DOFs correspond to?) 

It is also worth noting that even if the main application 
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here is polarization calculus, there are numerous other 
areas within theoretical physics where the group theoret- 
ical properties of SU(2) and are relevant, as. e.g. 
particle physics, 2-state quantum systems, spinor theory, 
or coupled mode theory. Thus the presented theory will 
certainly have wider applications than polarization cal- 
culus. 

This article is organized as follows: In sections 2 and 3 
we will for reference review the well-known unitary Jones 
and Stokes-Mueller calculi and how they can be mapped 
to each other. In section 4 we will describe the corre- 
sponding 4d rotations. In section 5 we describe the gen- 
eral (6 DOF) set of 4d rotations including the forbidden 
and allowed transformations, and their parameterization. 
Then in sections 6 and 7 we will connect back to the Jones 
and Stokes calculi, respectively, and see how they can be 
extended to deal with both sets of 4d rotations. We will 
show that the 6 DOF 4d rotation group form 2 commut- 
ing subgroups with 3 DOF each, of which the classical 
Mueller matrices correspond to one. We will also explic- 
itly prove that the extra two DOF are forbidden, in the 
sense that they do not (contrary to the other four DOF) 
satisfy the fundamental boson commutation relations. In 
section 8 we will, as an example, apply the generalized 
Stokes method to explain the Pancharatnam phase, and 
finally in section 9 conclude. The main results are sum- 
marized in Table |III| A number of appendices provide 
supporting material, definitions and details not necessary 
for the main understanding. 

We employ the following notation conventions: Com- 
plex 2-vectors are denoted by boldface lowercase letters, 
as e.g. e. Real 3-vectors (e.g. Stokes vectors or rotation 
vectors) are denoted with lowercase vector-arrow nota- 
tion as e. Unit vectors have a hat, as e. Real 4- vectors are 
denoted with uppercase vector-arrow notation as E. Ma- 
trices are usually uppercase, and their dimension should 
be clear from the context. For example the unity matrix 
may be denoted / independent of dimension. A few spe- 
cific matrices are denoted by lower case greek letters as 
e.g. the Pauli matrices aj and the left-(right-) isoclinic 
generators pj 

II. THE JONES DESCRIPTION 

The transverse electric field components of a plane 
electromagnetic wave, propagating in the z— direction, 
has only two components, and can thus be described as 
the column vector 




We refer to this vector as the complex Jones vector. Of- 
ten the Jones vector is given in the frequency domain, 
so we will allow it to be frequency dependent, although 
in this paper this dependence is not important, so we 
will not write it out explicitly. The complex vector ele- 
ments describe the x— and y— components of the electric 



field, and their phase relative to some phase reference. 
For most purposes, e.g. in coherent optical signaling, it 
suffices to describe how the Jones vector is transformed 
when propagating in the system. A linear transmission 
medium can then be described by its complex transfer 
function, and for electromagnetic waves this generalizes 
to a complex 2x2 Jones matrix T, that relates the input 
and output Jones vectors via 

e(z) = T(z)e(0) (2) 

where e(0) denotes the input (z = 0) Jones vector to 
the system and z is an arbitrary output position. The 
transfer matrix T can be an arbitrary 2x2 matrix with 
complex coefficients, but in this analysis we will restrict 
ourselves to systems without any polarization-dependent 
loss (such as transmission fibers) where the signal power 
P = e*'e = e + e must be conserved. This leads to = 
T + so that T will be a unitary transformation matrix, 
belonging to the unitary group U(2). The subgroup of 
unitary 2x2 matrices with determinant=+l is called the 
special unitary group, or SU(2). 

Since our purpose is to relate the transfer matrix be- 
tween different descriptions of the signal vector, we need 
to parametrize the unitary transformation, and to do that 
will find it useful to describe the evolution of the signal 
within the medium by the differential equation 

^^ = He (3) 

dz 

where H is a matrix describing the transmission medium 
properties. Again, for lossless transmission we must have 
d(e + e)/dz = e + (H + — H)e = which implies that the 
matrix H = H + is Hermitian (i.e. equal to its conjugate 
transpose) . A useful parameterization of Hermitian ma- 
trices is via the Pauli spin matrices o~j (see definition, no- 
tation and general properties in appendix|A]). This means 
that H can be written as a linear combination of the four 
Pauli spin matrices: 

H = h a +h\o-\ + h 2 o- 2 + h 3 a 3 = h a I + h-a (4) 
or explicitly 

tt _ ( h + hi h 2 - ih 3 \ , . 

^ h2 + ih 3 h - hi J ' w 

Here hk are 4 real coefficients describing the medium, 
which may be formed into a scalar /i and a real 3- vector 
h = (hi, h 2 , /13). Assuming H to be constant enables the 
differential equation ^ to be solved using the matrix 
exponential as 

e(z) = exp[-iHz]e(0) = Te(0). (6) 

The matrix exponential can be expanded in a Taylor se- 
ries to prove that 

T = exp[-iHz] = exp(-ih Q z)U(zh) (7) 
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where 

OL 

U(d) — exp(— ic? ■ a) = [Jcos(a) — i— ■ <jsin(a)l (8) 

a 

denotes the generic special unitary transformation, or 
SU(2) group member, parameterized by the real 3- 
component vector a, the modulus of which is the scalar 
a = \a\. Physically the parameterization of the medium 
in ho and h can be identified as follows: The h coeffi- 
cient in the Jones matrix ^ corresponds to a common 
phase change of both field components, and the remain- 
ing coefficients define a unitary transformation U via |8]) 
that gives a polarization state change. The simplest non- 
trivial examples arc (i) h = (hi ,0,0), for which T models 
a linear retarder plate with phase retardation ±h±z in the 
x— and y— polarizations, (ii) h = (0, /12, 0) which is a re- 
tardation ±/i2-z between the linear polarization states at 
±45 degrees relative to x and y, and (iii) h = (0, 0,h 3 ) 
which is a rotator (circular retarder) that rotates x and 
y an angle h%z. In the next sections we will describe this 
polarization transfer matrix in the Stokes-Mueller and 4d 
signal spaces, respectively. 



III. THE STOKES-MUELLER DESCRIPTION 

The 4-component Stokes vector corresponding to the 
Jones vector e has real components given by the scalar 



e + a e 



e + e (which is simply the optical power), 
and the real 3- vector e = e + ae. However, for fully polar- 
ized light and absence of polarization-dependent losses, it 
is sufficient to consider the evolution of the vector e only. 
As a consequence the transformation (Mueller) matrices 
will be 3 x 3. Explicitly, we can express e in the complex 
field components as 



2 ,2Re(e :c e;),-2Im(e :c e;)] < 



(9) 



Note that any common or absolute phase of e x and e v 
will not be modeled by the Stokes vector e. Since only 
the direction of the Stokes vector e will change it is of- 
ten described as a point on a sphere, called the Poincare 
sphere. We may then write the evolution equation for e 
by using the definitions and (3|4) to obtain 



de 
dz 



e + i(H + a - aH)e = 2h x e 



(10) 



coefficients this equation can be integrated, again using 
the matrix exponential, to 

e(z) = exp{z2hx)e(0) = M(z2h)e(0) (12) 

where M denotes the Mueller matrix, which relates the 
input and output Stokes vectors. In this case when we 
have no polarization-dependent losses, the Mueller ma- 
trix is a rotation matrix, generally expressed as 



M(a) 



I + sin(a)- 



exp [a x J 
1 — cos(a) 



-(ax) 



I cos(a) + sin(a)(dx) + (1 — cos(a))(ad-) (13) 

which describes a rotation an angle a = \a\ around the 
unit vector a — a /a. Therefore the Mueller matrix M 
is often described geometrically with the Poincare sphere 
rotating around the birefringence vector h. The set of 
real n x n matrices M preserving a real vector length 
must fulfill M _1 = M', and forms the orthogonal group 
O3. The subset of O3 with determinant =+1 forms the 
group. 

By comparing the Jones transfer matrix ([8| with the 
Mueller matrix (131 we have a mapping between the 



two formalisms that illustrates the isomorphism between 
the O3 matrices R(a) and the SU(2) matrices U(a). 
However the Jones description (although not the SU(2) 
group) is more complete in that it may also contain the 
absolute phase change on the wave from the medium (the 
ho coefficient in ([8]), which is absent from the Stokes- 
Mueller approach. On the other hand, the Stokes-Mueller 
approach facilitates the geometrically pleasing, and also 
intuitive, interpretation of rotating polarization vectors 
on the Poincare sphere. 

Finally we should mention the derivation made in the 
end of A ppen dix [A] which connects the coherency matrix 
ee + , see (A5|, and the corresponding Stokes vector e via 



ee + = 



P 



(14) 



If we have two different Jones vectors e 12 with corre- 
sponding Stokes vectors ei,2 and powers Pi. 2 = |ei,2| we 
can use this relation to obtain the useful scalar product 
(or projection) formula 



\etei\ 2 



P1P2 



ei • e 2 



(15) 



where hx denotes the cross product operator 



hx 




(11) 



The evolution equation ( 10 ) has the well-known geo- 



metrical interpretation of describing a vector e rotating 
around an axis directed along h. In the case of constant 



connecting Jones and Stokes state vectors. 

So far we have done nothing new, but the preceding 
relations between the Jones and Stokes-Mueller descrip- 
tions are well known, and can be found in numerous 
books and articles on the topic, e.g. [T71 [28, 29 . We 
provide them for reference and to enable easy connec- 
tions with a third and less known description, based on 
the four-dimensional Euclidean space that will be intro- 
duced in the next section. 
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IV. FOUR-DIMENSIONAL SIGNAL SPACE 
DESCRIPTION 

Communication theory analyze signals in a signaling 
space that is an n-dimensional vector space containing 
the set of transmitted signals (the signaling constella- 
tion), as a set of discrete points in n-d. The received 
signals belong to this set, possibly perturbed by distor- 
tions, rotations, and noise. The geometric description 
of signals in signal space is valuable, because the Eu- 
clidean distances between the constellation points will 
directly translate into robustness to additive noise [30] . 
For coherent optical communications the signaling space 
is four-dimensional (4d), so a real 4d description of the 
electric field is in many cases preferred over the Jones or 
Stokes-Mueller formalisms in this context (23j [JU [30] . 

The Jones description can be taken to an equivalent 
4d description by expressing the complex Jones vector e 
as the real 4- vector 



E(z) 



/ Re(e*) 
lm(e x ) 
Re(e y ) 

V Im ( e y) 



(16) 



A Jones transformation such as ^ can now be written 



as 



E(z) = N(z)E(0) 



(17) 



where N(z) is a real 4d transformation matrix. As we 
noted for the Jones and Stokes descriptions, the absence 
of polarization-dependent loss or gain restricts the power 
P = E l E to be conserved and thus N T N = I so the N 
is a 4d orthogonal matrix, belonging to the O4 group. 
In order to identify the elements of N with those of the 
Jones matrix T in ^ we study the evolution equation 
for the vector E, i.e. 



dE 

dz 



= -KE 



(18) 



for some matrix K describing the medium. Since P must 
be constant with respect to z, we can show that K should 
satisfy K 1 — —K, i.e. K should be skew- symmetric. This 
means that its four diagonal elements vanish, and only 
the 12 off-diagonal elements of K remain. By separating 
([3]) in real and imaginary parts, and using Q we find 
that the matrix K equals 



K 



( 


-(ho + hj) 


h 3 


-h 2 


ho + hi 





hi 


ha 


-ha 


-hi 





~(ho- 


V hi 


-ha 


h - hi 






(-i)(h Doa + hxDxa + h 2 D 23 ) + ih 3 D, 



30- 



(19) 



Here Djj~ with j, k £ {0, 1, 2, 3} denote the 16 Dirac ma- 
trices. The Dirac matrices are 4x4 matrices with com- 
plex coefficients that are a 4d generalization of the Pauli 



spin matrices, being defined via the Kronecker product 
as Djk = <Jj ® o~k- In appendix [B] they are tabulated 
and their basic properties described - ^ In analogy with the 
previous sections we can integrate ( 18 ) if the hk coeffi- 
cients are constant, and describe the 4d rotation using 
the matrix exponential as 



E(z) = exp(-zK)E(0). 



(20) 



This description can be further simplified, but for that 
we need to describe the general properties of 4d rotation 
matrices, which will be done in the next section. 

It should be mentioned that the Dirac matrices have 
been used previously in connection with polarization op- 
tics. The recent paper by Tahir et al. [jJJ, used the defi- 



nition ( Bl ) of the Dirac matrices to express the coherency 
matrices. However the 4d rotation description for polar- 
ization transformation that we pursue here was not dis- 
cussed. Nontheless, that connection was indeed made in 
another recent paper, by Baumgarten |32j . That work 
applied a set of real Dirac matrices (obtained by replac- 
ing (73 with io~ 3 in our definition (Bl I, and pioneered by 



Majorana in the 1930:ies [33]), both to describe relativis- 
tic electromagnetics, as well as to connect them to (3d) 
rotation matrices and Lorentz transformations. Baum- 
garten's work could thus probably help with a future ex- 
tension of the present work to the non-unitary case with 
polarization-dependent losses, where the Mueller matri- 
ces belong to the Lorentz group. 



V. PARAMETRIZATION OF 4D ROTATIONS 



The argument in the preceding section, that a rotation 
should preserve the vector length, can be generalized to 
show that a rotation matrix R — exp(L) in n-d can be 
expressed as the exponential of a skew-symmetric matrix 
n x n matrix L. Such a matrix has elements = — Lji 
so that the diagonal elements must be zero, and the n(n— 
l)/2 elements above the diagonal is enough to describe 
all degrees of freedom. In the special case of 4d we have 6 
degrees of freedom, i.e. we need 6 parameters to describe 
an arbitrary 4d rotation. It is useful to use a subset of the 
Dirac matrices to describe L, and the 6 skew-symmetric 
Dirac matrices are the Dij where one (but not both) of 
i and j equals 3. Note that these are the Dirac matrices 
with purely imaginary elements, so they are multiplied 
with the i in the below formulas to be made real. In 
appendix [C] we present a complementary discussion of 
4d rotations and their nomenclature. 

We will thus introduce the following 6 matrices as the 
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4d rotation basis (notation will be motivated later): 



for all j and k. Note that multiplications within each 



Pi 



-i)D 



13 



92 = (~i)D 2 





-1 








1 




















1 


Vo 





-1 





f° 





■ 


-1 








1 








-1 








\1 












/ 10 
1 
-10 
\ -10 



A3 = (i)D 30 
which forms the vector p = (pi, P2, p 3 ) and 
Ai = (i)D a3 



(21) 



(22) 



(23) 



( 1 
-10 
1 

V o o-io 



A, 



H)£>3 



(i)D 



31 



-1 
0-10 
10 
10 

/ 1 
-1 
-10 

V o i o o 



(24) 



(25) 



(26) 



which forms the vector A = (Ai,A2,A3). An arbitrary 
real skew-symmetric 4d matrix L can thus be written as 
a linear combination of these matrices, and an arbitrary 
rotation 

R(a, 0) = exp(L) = exp(-a • p — j3 ■ A), (27) 

i.e. as a linear combination of the six above matrices 
where a, (3 are two arbitrary, real, three- vectors param- 
eterizing the rotation. The negative sign is chosen to 
conform with the previous sections. We can thus call the 
six matrices p, A the generating matrices for 4d rotations. 
We may also observe that the following relations hold: 



p 2 k = x 2 = -I Vk 
P1P2P3 = -P2P1P3 = I 

A1A2A3 = — A2A1A3 = /, 



(28) 



where / is the 4d unity matrix. These multiplication 
rules can be used to show that matrices formed by a 
linear combination of the pk and the unity matrix form a 
multiplicative subgroup to the group of all real 4d skew- 
symmetric matrices. The same holds for the group of 
matrices formed by a linear combination of A& and the 
unity matrix. Quite surprisingly these two subgroubs 
commute, since 



subgroup is non-commuting, according to ( 28 ) . This has 
important consequences for the rotations; since we may, 
for commuting operators, decompose the matrix expo- 
nential to the matrix product 



where 



R — RrRl — RlRr 

Rr{o) = exp(-a • p) 
R L 0)=esxp(-0-X). 



(30) 

(31) 
(32) 



This implies that we can decompose an arbitrary 4d rota- 
tion in to two commuting subgroups, which are often de- 
noted right-isoclinic, Rr and left-isoclinic, Rl, rotations 
respectively |25.. Thus any 4d rotation can be written 
as one right-isoclinic rotation followed by a left-isoclinic 
rotation, or vice versa. 

The isoclinic rotations can be written in a more explicit 
form by evaluating the matrix exponential and using 



(d? • p) 2 = —Id 2 



(33) 



to obtain 



E 

fc=0 



Rr = exp[— a ■ p\ = exp[— ad ■ p\ = 
{—ad ■ p) k 



k\ 



a 2 a 3 
= I - (a • p)a - I— + {d- p) — 



= Jcos(a) — (a • p) sin(a). 
Analogously, we find for the left-isoclinic rotations 
R L = Jcos(/3) - 0- A)sin(/3). 



(34) 



(35) 



Eqs. ( 30|34|35 ) give us a full, 6 DOF parameterization 
of the 4d rotations. 

We can now return to Sec. |IV| and identify the ele- 
ments ho, h from the matrix K in (19) with the general 



PkXj — XjPk 



(29) 



4d exponent L. This gives a = h and /3 = (ho, 0,0). 
We can thus express the 4d signal (20) in terms of 4d 
rotations as 

E(z) = exp(-zh ■ p) exp{-z(h , 0, 0) • X)E(0). (36) 



This completes the description of polarization transfor- 
mations in terms of 4d rotations, and the result is that 
we now have three alternative descriptions of polarization 
transformations. These are the complex Jones calculus 
in Eq. (|2), the real Stokes-Mueller description in Eq. 
(12) and the real 4d rotation description from Eq. ( |36[). 
Those have, for convenience been summarized in Table [Ij 

The above concludes a relatively straightforward ex- 
tension of the Jones calculus to 4d, that in itself might 
be useful in e.g. fiber communication analysis 26], but 
it provides nothing really physically novel. However, the 
remainder of this paper will be devoted to the 4d rota- 
tions described by the left-isoclinic matrices A2 and A3, 
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TABLE I. Relation between the conventional Jones, Stokes-Mueller and 4d vector descriptions of electromagnetic transforma- 
tions. The transformation is characterized by four parameters; an absolute phase shift ceo and a polarization change described 
by the real 3-vector a = act. The vectors a, p, X are three vectors forme d by th e Pauli spin matrices, and the left- and right- 
isoclinic 4d rotation generators, respectively, which are defined in Eqs. ( 21)26 1. Note that the absolute phase shift is absent 
from the Stokes-Mueller description. 





Signal Vector 


Input / Output Transformation 


Transfer Matrix 


Jones 


-(:) 


Gout — TG-'iyl 


T = exp[— iao — id ■ a] = 

exp(— iqo)[/cos(q) — id ■ irsin(a)] 


Stokes-Mueller 


e = 


(\e*\ 2 -\ey\ 2 ) 
2Re{e x e* y ) 




e ut = Mein 


M =exp[2ax] = 

Jcos(2a) + sin(2a)a x +(1 — cos(2a))a(a-) 


Four-dimensional 


/Re(e^ 

J = Im( e «) 
Re(e y ) 

\lm(e y )J 




-Eout = RE\ n 


R — exp[— (ao, 0, 0) • A — a • p\ = 

exp(— aoAi)[/cos(o:) — a • psin(a)] 



which seem to have no correspondence in the classical 
Jones- or Stokes-vector descriptions. As we saw above, 



only the right-isoclinic rotations (35) of the 4d i?-field 



vector have a physical realization in terms of the corre- 
sponding Jones and Mueller matrices. Obviously the Ai- 
component gives a constant phase shift of the field vector, 
so that has a clear physical interpretation. However, the 
A2— and A3— components describe transformations that 
are not used within the classical Jones- or Stokes-Mueller 
calculus, and the reason is simply that they cannot be 
realized as physical transformations of the electromag- 
netic field. In fact, a propagating electromagnetic wave 
cannot undergo such a change, irrespective of the phys- 
ical realization. The proof of this statement lies in that 
those state changes will not satisfy the bosonic commu- 
tation relations, as will be motivated in next section and 
appendix [D] We will therefore refer to those transforma- 
tions as forbidden rotations. We should emphasize that 
the existence of the forbidden rotations is not a short- 
coming of the known polarization calculi. It is merely a 
richer property of the 4d real space that is not needed 
for the description of propagating waves, but which can 
be taken advantage of if we understand it. For exam- 
ple if we can take the absolute phase rotation (which in 
terms of 4d rotations is given by the exp(ic/?Ai)-matrix) 
to Stokes space, we may find a natural extension of the 
absolute phase rotations to Stokes space, a problem that 
researchers have worked on previously |34) . 



A first clue to an understanding of the forbidden rota- 
tions will be to go back to the Jones- and Stokes-Mueller 
formalisms, and attempt to express these transforma- 
tions there. We will start by describing how the Jones- 
vector formalism can be extended. 



VI. THE FORBIDDEN ROTATIONS IN JONES 
SPACE 

The forbidden (left-isoclinic) transformation acting on 



the 4d vector A 



is 



Amt = cxp(-/3£ • X)A h 



(37) 



where subscript denote input/output states. If we write 
this in complex form, by collecting the proper real and 
imaginary parts, it can be expressed as 



U(0) 



(38) 



where U is the unitary transformation defined in (Jsj) . 
Thus, we obtain the a unitary transformation as in sec- 
tion 2, but now acting on the vector [a x ,a*]*, i.e. the 
Jones vector with the y-component conjugated. Thus, 
to describe the left-isoclinic transformations with Jones 
calculus, we need to redefine the Jones state vector. Such 
transformation vectors arise e.g. in the context of para- 
metric amplifiers [35j , but then the transfer matrix is not 
unitary. In fact, in appendix D we show that such a 
transformation is only consistent with the quantum me- 
chanical formulation of polarization operators if the two 
components /?2 = /?3 = 0, for which (38 1 reduces to 






exp(-i/?i) 




(39) 



which is the same phase rotation applied to both field 
components. Therefore changes described by the A2 and 
A3 rotation generators are indeed forbidden for the elec- 
tromagnetic field. However this does not preclude a 
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TABLE II. Overview over the use of the 16 Dirac matrices in 
terms of the matrices used in the generalized Stokes calculus. 



Doo = 1 


D01 = — S33 


D02 = S32 


D 03 = iXi 


-Dio = sn 


Dn — — S22 


D\2 = — S23 


D13 = ipi 


D 2 o = S21 


D21 = S12 


-D22 = Sl3 


D23 = ipi 


Dzo = -ip 3 


D31 = -i\ a 


-D32 = — i\2 


D33 = -S31 



mathematical description of them. In the following dis- 
cussion we will describe the right (left-) isoclinic rotations 
with the dimcnsionlcss parameters a (f3). 

We may unify the left- and right-isoclinic transforma- 
tions in an elegant way by replacing the Jones vector with 
a Jones state matrix 



E 



(40) 



With this definition a right-isoclinic (i.e. the conven- 
tional) transformation is performed by applying a unitary 
transformation to the two column vectors. Similarly, a 
left-isoclinic (comprising the phase and forbidden) trans- 
formation is realized by applying a unitary transforma- 
tion to the row vectors. The first column vector is the 
conventional Jones vector and the second is the corre- 
sponding orthogonal state of polarization, and obviously 
these two must transform with the same unitary matrix, 
so the transformation UE is just a trivial extension of the 
standard Jones analysis. However, the left-isoclinic rota- 
tion are described by a unitary operation on the trans- 
posed matrix, i.e. by UE* = (EU 1 )*. Thus we can de- 
scribe the left-isoclinic transformations by left multiply- 
ing the state matrix with a transposed unitary matrix. 
The full input-output relation for the general 4d rotation 
becomes 



E out = R(a,/3)E il 



(41) 



where R is given by (27), can thus be compactly ex- 



pressed by the Jones state matrix as 

-Eout = U(d)E ia U0Y 



(42) 



where again U denotes the unitary matrix as defined in 
^ . In this way we can recover the full 6-parameter rota- 
tion family by left and right multiplying the Jones state 
matrix with unitary matrices. It is noteworthy that the 
Jones state matrix is of unitary form, but with determi- 
nant — P, which means that its unitary structure (40) is 
preserved by the transformation (42), as expected. The 
commutation between right- and left-isoclinic rotations 
is also evident in (42), due to the associativity of matrix 
products. 



VII. THE GENERALIZED STOKES CALCULUS 

We saw that the 4d rotations could be described in 
Jones space by extending the Jones vector to a matrix 



that is transformed by right- and left multiplication by 
unitary matrices. The same thing happens when we 
try to connect the forbidden rotations with the Stokes- 
Mueller description. We divide this section into two 
parts; first we present definitions and properties; and sec- 
ond follows a few basic examples. 



A. The Stokes state matrix 

In order to describe the forbidden rotations, we need to 
replace the Stokes vector with a 3 x 3 matrix E, which we 
will refer to as the Stokes state matrix, defined in terms 
of the complex field components e x and e y as 



E = 



(\e x \ 2 -\e y \ 2 
2Re(e x e* y ) 
^-2Im(e x e;) 



2Re(e x e y ) 
-Rc( e 2 
lm{e 2 x -} 



2lm(e x e y ) 
-Im( e 2 - e 2 y 
-Rc( e 2 



4), 



(43) 

By using the 4d vector E and the Dirac matrices 
defined in Appendix B we can also express the Stokes 
state matrix as 



Ejk — E t SjkE 



(44) 



where Sjk is a 3 x 3 matrix with components given by the 
4x4 Dirac matrices as 



(45) 



With these definitions we can, cf. Appendix [E] for de- 





( D w 


D21 


D22 


s jk = 






-D l2 




\-d 33 


D02 


-An 



tails, express the general 4d rotation (41 ) as 
E out = M(a)E in M0Y 



(46) 



where subscripts i n / ou t refers to the input and output 
states. We note that the right- (left-) isoclinic rotations 
is obtained by right- (left-) multiplication of the Stokes 
matrix with the corresponding rotation matrix. It is 
an interesting observation that the 6 rotation generators 
(21)-(26) together with the above 9 Stokes matrix com- 
ponents Sij and the unity matrix comprise all 16 Dirac 
matrices. This dependence is summarized in Table [Tl] 
The transformation (46) shows that the classical 



(right-isoclinic) transformations acts by rotating the col- 
umn vectors by the rotation matrix M(a), in full com- 
patibility with the classical Mueller-Stokes description in 
( 12 -reforth) . The novelty is the phase and forbidden (left- 
isoclinic) transformations, that acts on the row vectors 
instead via right multiplication of the rotation matrix 

M0y. 

The Stokes state matrix E is a very interesting gener- 
alization of the Stokes vector e. Some properties worth 
highlighting are: 



• The first column, 
Stokes vector e, see 



Eji equals the conventional 
p|, which does not depend 
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on any common phase of the electromagnetic field 
components (which we refer to as the absolute 
phase). 

• We define the vectors / and g from the remaining 
two column vectors, Ej% = / and Ej 3 — g, which 
are alternative Stokes vectors that, in absence of 
left-isoclinic transformations (j3 — 0), transform 
exactly as the traditional Stokes vector e = Ej±. 
The existence of such alternative Stokes vectors is 
believed to be reported here for the first time. The 
reason is probably that they involve the absolute 
phase of the field, that is not easily measurable in 
experiments. 

• E is an orthogonal matrix, with the three column 
vectors [e, f,g\ (or the three row vectors) mutually 
orthogonal. The length of each row or column vec- 
tor equals the power, P — \e x \ 2 + \e y \ 2 . 

• The triplet [e, /, g] forms a right-handed coordinate 
system; i.e. the determinant det(E/P) = 1. Any 
right-isoclinic transformation can be seen as a ro- 
tation of this coordinate system. 

• The triplet formed by the row vectors 
[Eij, E-ij, E^j] is another right-hand system that is 
transformed by the left-isoclinic transformations. 

• The mapping between (i) the Jones state matrix 
(and 4d vectors) to (ii) Stokes state matrix is 2- 
to-1, in that both the vectors E and —E will be 
mapped to the same Stokes state matrix. This is 
obvious from the defi nition being proportional to 
E times itself, cf. ( 44 1 . 



We may thus see that this is an appealing extension of the 
traditional Stokes vector to account for absolute phase 
changes via the vectors / and g. More mathematical 
properties of the Stokes state matrix, an explicit rela- 
tion to the Jones state matrix, and matrix-exponential 
expressions for the Jones and Stokes state matrices are 
given in appendix |F} 

We en d th is subsection by generalizing the projection 
formula ( 15 ) to a 4d vectors E. In appendix [b we derive 



the relation (BIO) 



E*E = 



(47) 



which can be seen as a generalization of the 2d coherency 
relation ( 14 ) to 4d by using the Stokes state matrix E and 



its elements Eij. We can use this formula to derive the 
scalar product relation for two different 4d vectors E\ 
and E 2 , as 

{E X ■ E 2 f = ^ + «Va + /i-/2+ft-& (48 ) 

where the last terms are the scalar products of the Stokes 
vector and the two alternative Stokes vectors for the re- 
spective fields indicated by subscripts. 



B. Using the Stokes state matrix 

A first example is the Stokes state matrix of a linearly 
polarized wave with common phase <f>, i.e. the Jones vec- 
tor [cos(0), sin(6*)]* exp(z0). Its generalized Stokes vectors 
are 

e = [cos(2(9),sin(2(9),0]* 

/ = [sin(20) cos(20), - cos(26») cos(2<?(>), sin(20)]* 
g = [sin(20) sin(20), - cos(26>) sin(20), cos(20)]* 

from which we can see that the absolute phase <f> affects 
the direction of the vectors / and g in the plane normal 
to e. Thus the direction of / and g is associated with the 
absolute phase of the state. 

As another example we consider the phase shift of 
eigenstate propagation, not visible in the classic Stokes- 
Mueller calculus. Consider the simplest linearly birefrin- 
gent medium, retarding the x vs. y components a phase 
<t>. The corresponding Jones matrix is 



T = ( exp(i0/2) 



exp(-i</>/2) 



(49) 



showing that for eigenstate (in this case x or y-polarized) 
waves, a pure phase retardation of ±xf> is incurred, with- 
out changing the state of polarization. The correspond- 
ing Mueller matrix is (the right isoclininc) 



M 



fl 

cos(0) 
y0 sin(0) 



\ 

— sin(</>) 
cos(0) J 



(50) 



which multiplies the Stokes state matrix E = [e, /, g] 
from the left. For x (y)-polarized light, e = [1,0,0]* 
(e = [—1,0,0]*) so that / and g lie in the Si = 0-plane. 
Assuming the absolute phase to be zero (which is no re- 
striction) leads to the E = diag(l, —1, — l)for x-polarized 
light and E — diag(— 1, 1, —1) for the y— polarized case. 
When analyzing the movement of / and g for the two 
cases, we can see that the direction of rotation is oppo- 
site, even if the rotation matrix is the same, since e and 
/ have been inverted. 

As a final example on the use of the Stokes state matrix 
in a transformation system we show a pure (left-isoclinic) 
phase rotation, (a = 0, /3 = (2</>, 0, 0) ) affecting the 
initial Stokes state 



E h 



which is right-hand circular polarization at zero absolute 
phase. After insertion in (46 1 we obtain the output state 




E n 



r -sin(2<£) cos(20) 

-cos(20) -sin(2</>) 

1 



10 



Column Stokes vectors 



Row Stokes vectors 





FIG. 1. Colour) Plot of the column (left) and row (right) vectors o f th e Stokes matrix for an initial circularly polarized wave 
with e x = and e y — i/v2, affected by a left-isoclinic rotation (39 1 with (3 = (2^,0,0), i.e. the same constant phase shift 



tj> in both field components. The phase shift (which in Stokes space gives twice the angular excursion) changes from to 0.5 
rad in steps of 0.1. The first, second, and third row/col vectors are colored black, red, and blue, and their initial states are 
marked with a star, circle and cross, respectively. This kind of polarization transformation which is an absolute phase shift 
will rotate the column Stokes vectors (left plot) around Ej\ vector. The row Stokes vectors (right plot), on the other hand will 
rotate around the first column axis. 



In Fig. [TJ left, we plot the Stokes state matrix col- 
umn vectors [Eji, Ej 2 , Ej 3 ] — [e, f,g\ and in Figlll right, 
we plot the row vectors [Eij,E2j,E 3 j], as the phase 4> 
changes from to 0.5 rad in steps of 0.1. In other 
words we plot the Stokes state matrix for the Jones vec- 
tor [exp(i0),iexp(^)]Vv^- In Fig. |l] left) we can see 
the standard Stokes vector (black) e remaining fixed, as 
it is not affected by absolute phase rotations, whereas the 
/ and g vectors rotate around it. The row vectors on the 
other hand rotate around the 1-axis, as can be expected 
from the rotation operator M((2(f>, 0, 0)). 

Thus we can conclude the following fact from the above 
examples: An absolute rotation <j) of the optical wave 
leads to a rotation of /and g an angle 2<j> around e. With- 
out the alternative Stokes vectors this rotation would not 
be easily visualized within in the Stoke-Mueller calculus, 
although attempts have been made in that direction pre- 
viously [53 1 156 ] . 



effect as well. 

Both effects can be intuitively described by the follow- 
ing well-known geometrical fact: Move a vector /, par- 
allel to the surface of a sphere, around a closed loop C 
on the sphere and back to the starting point, and denote 
that vector /'. The movement should be so called parallel 
transport, i.e., a translation without rotation relative to 
the local coordinate system set up by the coordinate vec- 
tor, its velocity vector and their cross product. Then / 
and /' will make an angle O, which equals to the solid an- 
gle (or unit sphere area) that C encloses. On a Euclidean 
2d (non-curved) surface / and /' will obviously coincide, 
but on a curved surface like that of a sphere they will 
deviate. This is illustrated in Fig[2j This phenomenon of 
parallel transport is often used to define curvature, e.g. 
in the context of general relativity and the study of met- 
ric spaces. We will now discuss the two effects in more 
detail. 



VIII. THE PARALLEL TRANSPORT 
PHENOMENA 



A. The Pancharatnam phase 



We will now, as an example of the use of the Stokes 
state matrix, describe the Pancharatnam phase effect. 
Since it is often mentioned (and sometimes mistaken for) 
another phenomenon of similar origin, the Berry phase, 
we will for completeness compare and contrast with this 



In 1956 Pancharatnam [551I57] wrote two groundbreak- 
ing papers with the main purpose of analyzing inter- 
ference of light with different polarization states. The 
first paper |36j dealt with coherent light, and extends 
the the Fresnel-Arago interference laws to the case of op- 
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FIG. 2. Parallel transport of the orthogonal vector triplet 
E = [e, /, g\ where eis radially directed and /, g are tangential 
to the sphere surface in the starting point (at the north pole). 
After parallel transport (i.e. translation without rotation) 
of the triplet around the closed loop C, the triplet [e* , f ,jf] 
where e = e* is formed. The angle between / and /' (and also 
between g and g") equals Q, the solid angle subtended by C. 



tical waves with arbitrary states of polarization. The 
main result was that for interfering a polarization state 
S\ with another state S2 one must decompose state S2 
into one part parallel with state S\ and one part parallel 
with the orthogonal polarization —Si. This projection 
involves a phase shift that can be expressed in terms of 
the area of a spherical triangle formed by the involved 
polarization states and their orthogonal counterparts on 
the Poincare sphere. An important consequence of Pan- 
charatnam's analysis, pointed out by Berry [38], can be 
stated as follows: If a polarized wave with Stokes vector 
S evolves through a system so that its polarization state 
traces out a closed path C with solid angle il on the 
Poincare sphere, then the starting and finishing states 
will have an absolute phase difference equal to f2/2. 

Instead of using Pancharatnam's spherical trigonom- 
etry, this can now be readily explained by using the 
orthogonal vector triplet [e,f,g\, and the phenomenon 
of parallel transport on a sphere. The vector e is the 
conventional Stokes vector, orthogonal to the Poincare 
sphere surface, and it will trace out C. The two alterna- 
tive Stokes vectors / and g will be parallel transported 
around C, and according to the parallel transport phe- 
nomenon make an angle il with their initial counterparts. 
As shown in the previous section, the angular difference 
between the /-vectors of copolarized waves will be half 
the absolute phase difference, i.e. £1/2. In this way the 
Stokes state triplet [e, f,g\ becomes a useful tool to de- 
scribe the Pancharatnam phase change. 



B. The Berry phase 

The Berry phase |39| is a very general concept, arising 
in a number of different physical settings where systems 
evolve one period while changing a characteristic phase 
adiabatically. The Aharanov-Bohm effect and Foucault's 
pendulum are but two manifestations [40] . In our con- 
text of the polarization of light it refers to the polariza- 
tion change for light propagating along a curved three- 
dimensional path 0T]. It can be explained as follows. 
Consider planar polarized light propagating into a sys- 
tem with direction (wave vector) k taking it through an 
arbitrary path in three dimensions (e.g. a coiled optical 
fiber) , and then leaving the system in the same direction 
k. During its propagation, k traces out a closed path C 
on the unit sphere, and the solid angle fi of C will exactly 
equal the angular change of the polarization plane from 
input to output [41] . 

Also this phenomenon can be explained by paral- 
lel transport, but this time the orthogonal triplet is 
[kz, e x x, e y y], i.e. the wave vector and the two transverse 
polarization components. Again the parallel transport 
phenomenon will act by rotating the polarization com- 
ponents (or more specifically the transverse coordinate 
system) an angle fi. Contrary to Pancharatnam's phase, 
which is an absolute phase change induced by a polar- 
ization evolution, the Berry's phase is a polarization 
change induce by a changes in propagation direction. 
Although the Berry phase for lightwave propagation 
was extensively investigated in the late 80:ies, it has 
a much older history. In fact, this purely geometrical 
effect was originally discovered by Rytov already in 
1938 [32] for general electric E and magnetic H field 
vectors, and specifically evaluated for polarization states 
by Vladimirskii three years later [43]. The effect is 
of major practical importance as relating mechanical 
perturbations to polarization changes in fibers, and it is 
one of the most significant sources to polarization drift 
in fiber communications. 

IX. DISCUSSION AND CONCLUSIONS 

In Table Mil we summarize the connection between 4d 
rotations, and the generalized Jones- and Stokes-Mueller- 
calculi, which can be seen as the main contribution of this 
article. The underlying reason that 4d rotations can be 
modeled as 2 separate 3d rotations in this way is the iso- 
morphism between the S'0(4) group and two 3d rotation 
groups, O3 x O3", as is well-known in group theory [2"5] . 
However this is to our knowledge the first time this iso- 
morphism have been explicitly connected to polarization 
analysis. 

In the introduction we mentioned three properties of 
these results worth emphasizing; extension, commuta- 
tion, and synthesis. We will discuss them individually 
below. 
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TABLE III. Extension of the conventional Jones and Stokes-Mueller calculii to account for the full 4d rotations, characterized 
by the 2 real three-vectors a and /3. The vectors B, p, X are three-vectors formed by the Pauli spin matrices, and the left- and 
right-isoclinic 4d rotation generators, respectively, which are defined in Eqs. ( 2lf26 l, The unitary matrix U is defined in Q 
and the orthogonal matrix M in 1 13 1. The case shown in table [i] is recovered as the special case (3 = (qq, 0, 0). 





Signal State 


Transformation 


Transfer Matrix 


Jones 


-1 


ye y —e X/ 


) 


Bout = U(a)E- m U0Y 


U (a) — exp[— id ■ a] 
U0) =exp[-0- 3} 


Stokes-Mueller 


(\e x \ 2 — \e y \ 2 2Re(e x e y ) 2lm(e x e y ) \ 
E = 2Re(e x e y ) —Re(e x — e 2 ) — Im(e^ — ejj) 
\- 2Im(e a: eJ) Im(e| + ej) -Re(e 2 x + e 2 v ) ) 


E out = M(a)E in M0y 


M(a) = exp[2ax] 
M0) = exp[2/3x] 


Four-dim. 


E = 


/Re(e x )\ 
lm(e x ) 
Re(e y ) 

\Im(e y )J 




E ou t = REi n 


R = exp[— /3 • X — a - p\ 



A. Extension 

There are two aspects of extension of the existing po- 
larization calculi presented in this paper. The first is 
the mapping to real 4d geometry of the Jones calculus, 
which is presented here for the first time. This should be 
of value in communications, where a multidimensional 
real formalism is often preferred over complex analysis. 

The second aspect is the extension of the existing Jones 
and Stokes-Mueller calculi to account for the left-isoclinic 
transformations. This were made possible by the intro- 
duction of the state matrices rather than state vectors. In 
the case of Stokes calculus this also enabled the absolute 
phase changes to be visualized on the Poincar sphere. 
The idea to model absolute phase changes as rotations 
around the classical Stokes vector was actually proposed 
previously by Frigo and Bucholtz [33] (and probably un- 
derstood by Pancharatnam [35]) but it is interesting to 
see that it emerges as a natural consequence of modeling 
the full 4d rotation transformation set. This extension of 
the conventional Stokes-Mueller calculus to account for 
absolute phase should be quite useful, since it does not 
alter the conventional definitions of the Stokes vectors, 
but appear as a mere extension of it. In addition it pre- 
serves some of the appealing 3d properties of the Mueller 
calculus, by modeling birefringence as Poincare sphere 
rotations. 



B. Commutation 

The fact that right- and left-isoclinic transformations 
commute is a unique property of the 4d rotations that 
greatly facilitates analysis and system realization of the 
forbidden rotations. Obviously it has been known, al- 
most trivially, that absolute phase changes and polariza- 
tion changes commute, but now we put it on a more firm 
mathematical ground also in Stokes space. The finding 



of the forbidden transformations, and the fact that those 
will commute with polarization (right-isoclinic) transfor- 
mations, but not commute with absolute phase changes 
is also intriguing and of potential use in future applica- 
tions in the communications area. 



C. Synthesis 

An obvious question regarding the discovery of the for- 
bidden transformations is: so what? If they are unphys- 
ical we need not care, right? Well, the fact that they 
cannot occur during photon propagation does not pre- 
vent them from being synthesized artificially. A coher- 
ent detector followed by signal processing and a coherent 
transmitter will enable any transformation of the elec- 
tromagnetic wave. To be able to artificially synthesize 
a transformation that does not affect the signal power 
(since it is a rotation) and that cannot be undone by wave 
propagation may have interesting applications in commu- 
nications, imaging, spectroscopy and signal processing. 

It is illustrative to describe a forbidden rotation in 
geometrical terms. The rotation described by circular 
birefringence is described by the 4d rotation exp(— 4>p§) , 
which in 4d is the double rotation D c (see appendix [C] for 
the exact rotation matrix) that rotates the two real and 
the two imaginary vector components in the same direc- 
tion. The corresponding forbidden rotation exp(— 4>Xz) 
rotates the real and imaginary parts of the two vector 
components in opposite directions, which is clearly un- 
physical for wave propagation, but not impossible to syn- 
thesize. 



D. Summary and outlook 

To summarize, we have presented an extension of the 
conventional polarization analysis to cover the full de- 
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grees of freedom of real four-dimensional rotations. We 
have identified physical as well as unphysical degrees of 
freedom in the process. Future work on this theory could 
be to extend it to a full polarization calculus, accounting 
also for polarizing elements with polarization dependent 
loss or gain, as well as looking in to applications for the 
forbidden transformations. 
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complex 2x2 matrix C can be written as 
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C = Y,Ck<Jk (A3) 

fc=0 
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C = ee+ = Cx6x ex£ y 



the coefficients Ck are given by 

c k = Tr(ee+cr fe )/2 = {e+a k e)/2. 



(A5) 



(A6) 



so we obtain the relation between Jones and Stokes vec- 
tors 

+ P + e- a 
ee+ = . (A7) 



as originally pointed out in [TTJ [T3] . 



Appendix B: The Dirac matrices 



Appendix A: The Pauli matrices 

As is common in polarization work [551 US] we define 
the Pauli matrices in (for physics literature, see e.g. [2H 
Ch. 4.5]) nonstandard order, to comply with the conven- 
tional Stokes vector definition used in the classic optical 
text as, e.g. [Hj. Thus the Pauli spin matrices are de- 
fined by 

" = (o ?)' CT1 = (o 

-(:;)-(::)• <-> 

which can be formed into a 3-vector with 2x2 matrices 
as elements a — (o~±, (72,173), cf. [13] . The Pauli matrices 
are Hermitian and have zero trace (except for Tr(ao) = 
2). They satisfy the multiplication rules 

222 
Co = cr 1 = <j 2 = cr 3 

cr lC r 2 = ~a 2 <Ti = ia 3 (A2) 

where the last equation allow for cyclic permutation of 
indices. This also shows that each Pauli matrix (exclud- 
ing (To) anticommutes with the other 2 Pauli matrices, 
and commutes with itself and Co- The Pauli matrices 
are linearly independent and can be used as a basis for 
all complex 2x2 matrices. In other words, an arbitrary 



The Dirac matrices Dj k is a useful basis for 4x4- 
matrices, and owe their name to Paul Dirac who used 
such matrices in his analysis of the dynamics of relativis- 
tic electrons (the Dirac equation). However, we will not 
follow the standard 7 notation for these matrices used in 
relativistic electrodynamics [25], but define them as the 
Kronecker product of two Pauli matrices [HI Ch 4.5], i.e. 
as 

D jk = Uj ® <r k (Bl) 

which means that there are 16 Dirac matrices, since j, k 
each ranges over 0,1,2,3. The 16 matrices are shown in 
table [GO 

From the definition and the Kronecker product rule 

(A®B)(C®D) = AC®BD (B2) 

(which holds for matrices ABC and D that can be mul- 
tiplied as indicated) one can derive a number of useful 
properties [22 Ch. 4.5]: 

1. Since the Pauli matrices are Hermitian, so are the 
Dirac matrices, i.e. they equal their conjugate 
transpose: 

Df k =D+=D jk . (B3) 

2. The Dirac matrices square to the unity matrix: 

D) k = a) ®o-l = a ®<T =I. (B4) 
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TABLE IV. The 16 Dirac matrices Djk = <Tj ® Ok- Note that the lower right 3x3 subset form a multiplication table, since 
DokDjo = Djk- The same table is given in [31], and (with redefined indices) in [221 Table 4.1]. 
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3. The Dirac matrices are traceless (except for the 
unity matrix), since Tr(Dy) 



TraiTrcTj = 4^ ^o- 



The product of two Dirac matrices equals to an- 
other Dirac matrix (within a constant phase factor) 



Dj k Dj, k , = (<JjOy) <g> ((Tfc(7fc') 



(B5) 



where the right hand side need to be evaluated after 
the Pauli product rules ( A2 1. 



The exponential of a Dirac matrix can be written 
in polar expansion form as 



exp(iaDjk) = cos(a)7 + % sin(a)Djk 



(B6) 



where a is a real constant. This follows from ex- 
pansion of the exponential to an infinite sum and 
then noting that the even terms are proportional 
to the unity matrix I and the odd to Dj k . 

6. Each Dirac matrix commutes with 8 Dirac matrices 
(two of these are D 00 and itself), and anticommutes 
with the other 8 Dirac matrices. The exception is 
Dqq which obviously commutes with all 16 matri- 
ces. 

7. Since all Dirac matrices are linearly independent, 
they form a basis for all 4 x 4-matrices. For example 
an arbitrary 4 x 4-matrix M can be written 



M 



(B7) 



where repeated indices (here and below) indicate 
summation, and are 16 complex constants. By 



multiplying M with D k i and taking the trace, only 
the iriki term remains (the others will have zero 
trace), and we can thus express the coefficients as 



m kl = Tr(MD kl )/A. 



(B8) 



This last relation will help us obtain a coherency relation 
similar to (A7). Consider the 4d "coherency matrix" 
F = EE*, where E is a real 4d vector. According to 
p7|, F = f kl D u where 



hi - Tr(FD k i)/4 = (E t D k iE)/A. 



(B9) 



However, since F is symmetric, the 6 elements f^ k and 
fj3 with i,j ^ 3 (corresponding to the antisymmetric 
Dirac matrices) will vanish, and in addition the power is 
given by 4/oo = P ■ Thus we can express the 4d coherency 
relation (in terms of the Stokes state matrix elements Eij 
as 



E t E 



P -\- Eij Sij 



(bio) 



where Ea. 



are nine real coefficients and Sjj 



denote the reordered Dirac matrices related to the Stokes 
state vector, defined in (45 1 and Table [TT] The reordering 
is done to have the conventional Stokes vector as the first 
column of the Stokes state matrix. In this way we have a 
useful generalization of ( A7 ) to 4d vectors and the Stokes 
state matrix. 
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Appendix C: The 4d rotations 

Much of the material in this appendix can be found in 
the Wikipedia article [S3], but it is given here for refer- 
ence and as an introduction. 

To classify the 4d rotations we start by defining the 
simple rotations, that rotate only two coordinate axes 
while leaving the plane spanned by the other two invari- 
ant. For example a simple rotation in the 1,2-plane is 



( cosO) sin(</>) 0\ 

-sin(» cos(» 

10 

\ 1/ 



(CI) 



which leaves the 3-4-plane invariant. Another one is 



T 24 (</>) 



(l \ 

cos(0) sin(0) 

10 

\0 -sin(» cosO)/ 



(C2) 



which leaves the 1,3-plane invariant. In this way we can 
define six simple rotations Ti 2 , T13, T14, T23, T24, T34 that 
span the full 6-DOF space of 4d rotations. 

The most general rotation in 4d is the double rotation, 
which leaves only the origin invariant. For example we 
can realize a double rotation as the product of two simple 
rotations as 



\ 





Ol>2) 



A,(0i,&) = Ti2(0i)T34(0a) = 

^ cos(</>i) sin((/>i) 

— sin(i/) 1 ) cos((/>i) 

cos(0 2 ) sin(0 2 ) 

\ -sin(^ 2 ) cos(0 2 )/ 

And we may similarly define -Dt>(</>i, 2 ) and D, 
in an analogous way as 

Db{4>\,4>2) = J 1 i 4 ((?!)i)T 2 3((/i 2 ) = 

/cosOi) -sin(</>i)\ 

cos(0 2 ) -sin(0 2 ) 

sin(0 2 ) cos(</> 2 ) 

Vsin(^i) cosOi) J 

and 



d c (0i,0 2 ) = r 13 (0i)r 24 (0 2 ) = 

^cos(^i) 


sin(</>i) 

V 



— sin((/)i) 

sin((/) 2 ) 
cosOi) 
cos 



cos(<£ 2 ) 


-sin(0 2 ) 

to get a set of 3 double rotations that span all six DOF. 



° ^ 



(<h)J 



A double rotation for which the two rotation angles are 
equal is called isoclinic. We will distinguish between the 
left-isoclinic, which have the two double rotations going 
in the same direction, and right-is oclinic where the rota- 
tion angles are in opposite directions. Thus -D Qj & jC (</>, </>) 
are the left- and £> a ,fc,c(0, — <t>) are the right-isoclinic ro- 
tations. Any simple rotation can thus be described as 
the product of a right- and a left-isoclinic rotation with 
the same angles. 

The left- and right-isoclinic rotations form two 3-DOF 
subgroups, i.e. any sequence of left-isoclinic rotations will 
always remain left-isoclinic, and vice versa. The really in- 
teresting property of the isoclinic parameterization, how- 
ever, is that any left-isoclinic rotation commutes with 
any right-isoclinic. Rotations within each subgroup do 
not commute, however. This enables any 4d rotation to 
be expressed as a (commuting) product of one right- and 
one left-isoclinic rotation. 

The underlying group theoretical reason for this is that 
the 4d rotation group, SO{4) is isomorphic with the prod- 
uct of two 3d rotation groups x . In other words, 
each of the two isoclinic groups can be mapped to the 
set of real 3d rotations. To present such a mapping is, in 
fact, the underlying purpose of this article. 



Appendix D: Left-isoclinic photon transformations 
are (mostly) forbidden 

Dr. Colin McKinstrie suggested the derivation in this 
appendix to me. 

We make the customary extension (similarly to e.g. in 
[151 ITT] ) to quantum mechanics where the electromag- 
netic field amplitudes e x and e^correspond to the quan- 
tum mechanical operators tp x and tp v , which obey the 
boson commutation relations 



(Dl) 



for j, k £ {x,y}. We will now study input-output trans- 
formations for these operators, and the requirements the 
commutation relations put on the transformation matri- 
ces. We consider left- and right-isoclinic transformations 
separately. 

Right-isoclinic transformations: This is the conven- 
tional input-output unitary transformation 




7 V \ I i>xi 
-rf 7*i Uj) yi 



(D2) 



where subscripts i, o correspond to input, output, and 
the unitary property relates the complex transformation 
coefficients 7 and 77 via 



171 



r'/l 



= 1. 



(D3) 



If we now apply the commutator relations (Dl| to the 



output operators, we find that the transformation coef- 
ficients must obey \^\ 2 + |?y| 2 = 1, i.e. the commutator 
relations are satisfied for all right-isoclinic transforma- 
tions of the polarization operators. 
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Left-isoclinic transformations: This transformation is 




where again unitarity requires (D3), but the lower-row 




(D4) 



vector components are conjugated, in contrast with the 
reight-isoclinic counterpart. If we now apply the com- 



mutator relations (Dl| to the output operators of (D4|, 



we find that the transformation coefficients must obey 



111' 



| r\ | = 1, which together with (D3) leads to 



M = l 



(D5) 
(D6) 



which physically corresponds to the same phase shift for 
both polarization components. This means that the left- 
isoclinic transformation is only possible if 77 = 0. 

To conclude, we showed that the quantum mechani- 
cal commutation relations are fulfilled for right-isoclinic 
transformations, and left-isoclinic transformations that 
are pure phase shifts (i.e. have 77 = in (D4) 
remaining two DOF (77 7^ 0) for the left-isoc 
formations are thus forbidden 
transformations. 



mic trans- 
or nonphysical, photon 



Appendix E: Stokes-Mueller calculus for right- and 
left-isoclinic rotations 



In the following we will sketch a derivation of the gen- 
eralized Stokes calculus for the combined case of right - 
and left-isoclinic rotations. Our starting point is the 4d 
evolution equation that can be written as 



dE 



-KE = ~(h a ■ p+hp- X)E 



(El) 



where h a parameterizes the right-isoclinic rotation, and 
hp the left isoclinic one. The matrix K is the most gen- 
eral antisymmetric 4x4 matrix, para mete rized in t erms 
of the vectors p and A as defined after ( 23 ) and ( 26 ) . By 



using the definition of the Stokes matrix ( 45 ) we can use 



(El) to express its differential equation as 



dE k 
dz 



= -E l 



h K]E 



(E2) 



for each element Eij in the Stokes state matrix E, and 
where [A, B] = AB — BA denotes the commutator. It 
may now be shown (by straightforward but lengthy in- 
spection) that the commutators of the matrices Sy and 
the rotation generators satisfy 

[ s ijtPk] = —2£iklSlj 



[Sij, Afe] 



(E3) 

ij, 'vsj — — Ztjkisu (E4) 

where ey^ is the Levi-Civita tensor. We recall that the 
cross product between two vectors a and b can be written 
in tensor form 



a x b — a-jbue 



j Ok ^ij k 



(E5) 



state matrix 



Thus ( E2 ) can be written as an equation for the Stokes 
(2h a x)E + E(2h p x) t (E6) 



dE 
dz 



which has the solution 

E{z) = M(2zh a )E(0)M(2zhpzx) t = 

M(a)E(Q)M0Y (E7) 



where M is the 3d rotation operator defined in ([13]) , a = 
zh a and j3 = zhp. This concludes the derivation of the 
input-output relation for the Stokes state matrix. 



Appendix F: Relations between the Jones and 
Stokes state matrices 



The Jones state matrix E is defined with the conven- 
The tional Jones vector ( x ) in the first column, and its or- 



y 



thogonal state in the second, i.e. 



E 



(Fl) 



The signal power is given by P — \e x \ 2 + \e y \ 2 = —det(E). 
Being a unitary matrix it can be expressed using the ma- 
trix exponential of a Hermitian matrix as follows. First 
define the real vector p = [Re(e x ), Ke(e v ),lm(e y )} = \p\p, 
where p is the corresponding unit vector. Then we have 



(F2) 



where cos(</>) = — Im(e x )/vP. 

We can do the same with the Stokes state matrix E. 
Being orthogonal it can be expressed as the matrix ex- 
ponential of a skew symmetric matrix as 



E = (— i)v P exp[i(fi p ■ a] 



E = P cxp(-20 px) 



(F3) 



We can also give an explicit relation between the two 
state matrices, namely EjkO~k = E + o-jE, or more explic- 
itly 



E lk a k = E+a 1 E 
E2kO~k = E + a%E 
EskO~k = E + a 3 E 



(F4) 



where the RHS are scalar products between a and each 
of the the real row Stokes vectors Ex^E-xk or E 3 k- The 
LHS are the product of three 2x2 matrices. It is note- 
worthy that these relations are mathematically similar to 
the relation between conventional Jones matrices and 3d 
Mueller matrices, see e.g. Ch. 2.6]. In terms of the 
vectors [e,f,g\ we can write this as 



(efe,/fe,5fc) ' & = E + a k E, 



fee {1,2,3} (F5) 
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where e.g. refers to the fc:th vector component of /. 



Finally, for reference it is useful to express the Stokes 



state matrix in terms of the Jones vector phases in nor- 
malized form, i.e. for a normalized Jones vector e = 



cos(0) exp(«0 x ) 
sin(0) exp(i(j)y) 



the Stokes state matrix becomes 



cos(2#) sin(20) cos((f> x + 4>y) 

E = | sm(20) cos((/) x — (f> y ) — cos 2 (#) cos{2<f> x ) + sin 2 (#) cos(2<^) 
— sm(20) sxa((/) x — <f> v ) cos 2 (9) sm(2(j) x ) + sin 2 (9) sin(2</> y ) 



sin(20) sm(4> x + <j) y ) 
- cos 2 (9) sin(2^a;) + sin 2 (#) sin(2^ y ) 
cos 2 (0) cos(2^a;) - sin 2 (#) cos(2<j) y ) 



(F6) 
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